Fluctuation-driven dynamics of the Internet topology 
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We study the dynamics of the Internet topology based on empirical data on the level of the autonomous sys- 
tems. It is found that the fluctuations occurring in the stochastic process of connecting and disconnecting edges 
are important features of the Internet dynamics. The network's overall growth can be described approximately 
by a single characteristic degree growth rate ^cft ~ 0.016 and the fluctuation strength OcH ~ 0.14, together 
with the vertex growth rate ct ~ 0.029. A stochastic model which incorporates these values and an adaptation 
rule newly introduced reproduces several features of the real Internet topology such as the correlations between 
the degrees of different vertices. 

PACS numbers: 89.70.+C, 89.75.-k, 05.10.-a 
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Recently many studies on complex systems ^ g] paid at- 
tention to complex networks An interesting feature 
emerging in such complex systems is a power-law behavior 
in the degree distribution, Puik) ~ k^'^ |[|, where the 
degree k is the number of edges incident upon a given ver- 
tex. Recently, Barabasi and Albert (BA) introduced an 
evolving network model to illustrate such networks, called the 
scale-free (SF) networks, in which the number of vertices N 
increases linearly with time, and a newly introduced vertex is 
connected to already existing vertices following the so-called 
preferential attachment (PA) rule. 

Huberman and Adamic (HA) [0] proposed another scenario 
for SF networks. They argued that the fluctuation effect aris- 
ing in the process of connecting and disconnecting edges be- 
tween vertices, is the essential feature to describe the dynam- 
ics of the Internet topology correctly. In this model, the total 
number of vertices N{t) increases exponentially with time as 

N{t) ^ N{0)exp{at). (1) 

Next, it is assumed that the degree ki at a vertex i evolves 



through the multiplicative process 1 20 1 



(2) 



where C,i{t) is the growth rate of the degree ki at time t, which 
fluctuates from time to time. Thus, one may divide the growth 
rate Q (t) into two parts. 



(3) 



where go,i is flie mean value over time, and ^i{t) the rest part, 
representing fluctuations [pT]]. ^^(t) is assumed to be a white 
noise satisfying {^t{t)) = and = <^o.i^t.t'Sij, 

where CTq ^ is the variance. Here (• • •) is the sample average. 
For later convenience, we denote the logarithm of the growth 
factor as Gi{t) = ln(l + Cii^))- Then a simple application of 
the central limit theorem ensures that ki{t)/ki{to), to being 
a reference time, follows the log-normal distribution for suffi- 
ciently large t. To get the degree distribution, one needs to col- 
lect all contributions from different ages t^, growth rates go^i, 
standard deviations CTo,i and initial degree ki{to). HA first as- 
sumed that Ci are identically distributed so that go.i = go and 



o'o.i = CTo for all i. Then the conditional probability for de- 
gree, P]:i{k,T\ ko), that ki — k at time t — to + t, given 
ki — ko a.t t — to is given by 



Pu{k,T\ ko) 
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(In (fc/fcp) - ffcffr) 
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(4) 

where g,s = (G,(t)) and = ((G,(t) - (G,(i)))'). ffoff 
and a^ff are related to go and ctq as ^off ~ o ~ "'o / 2 and cr^g « 
CTq, respectively [^. Since the density of vertices with age r 
is proportional to p{t) ~ exp(— ar), the degree distribution 
collected over all ages becomes 



PD(fc) - / dTpiT)PDik,T\ ko) ~ k- 



where 



7 



gcs 

2 

ff 



V.9off + 2aCT,"ff 



(5) 



(6) 



^cff 



Therefore, it is instructive to know the effective values of ^off 
and CTcff to determine the degree exponent. 

The Internet topology on the level of the autonomous sys- 
tems (AS) has been recorded by the National Laboratory for 
Applied Network Research (NLANR) [jl^] since November 
1997, which enables one to study its evolution. Here a node 
represents an AS, which is a unit of router policy in the Inter- 
net, consisting of either a single domain or a group of do- 
mains. Analysis of these data sets has been performed by 
several research groups. Some of the findings are as fol- 
lows: First, the numbers of vertices N{t) and edges L{t) 
increase exponentially with time [pl]|, and L(t) ^ N{t)^'^^ 
with 6 > 0, showing so-called the accelerated growth Jl^ . 
Second, the degree distribution follows a power-law with ex- 
ponent, 7 « 2.2±0.1 |[n|[l|]. Third, there occurs the PA 
behavior in the evolution process [p4|]. Lastly, there exist cer- 
tain non-trivial coiTelations between the degrees of different 
vertices |]l3|]. Meanwhile, Capocci et al. [|l5| considered the 
Internet on the levels of both the Internet Service Providers 
(ISPs) and hosts, in which the relative frequency of the cre- 
ation of the ISPs and hosts determines the degree exponent. 
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In this Letter, we analyze the empirical data of the Inter- 
net topology from the viewpoint of fluctuation-driven adaptive 
dynamics arising in the process of connecting and disconnect- 
ing edges. We find that the fluctuation effect is indeed essen- 
tial to the dynamics of the Internet topology. We measure the 
growth rate of vertices a and the effective values of gcft and 
(Tcff , and determine the degree exponent 7 using Eq.(Q), which 
is well compared with the directly measured value. Moreover, 
using the measured values, we construct a stochastic model 
following the HA idea. In addition, we include in our model 
an adaptive process that favors rewiring towards vertices with 
higher degree. The network structure constructed in this way 
reproduces the topological features of the real Internet, such 
as the correlations between the degrees of different vertices. 

Numerical analysis. In our analysis, the data are selected 
monthly, so that time is discretized with one month as a unit 
time step. We have analyzed the data from November, 1997 
through January, 2000, corresponding to t = and t = 26, 
respectively. First we examined the total number of vertices 
N{t) existing at time t, which grows exponentially with time 
as N{t) = iV(0) exp(ai) with a w 0.029(1) (see Fig^. The 
number of edges L{t) also increases exponentially with time 
as L(t) = L(0) exp(/3t) with (3 w 0.034(2) (see Fig.|l]), lead- 
ing to the relation L{t) = iV(i)i+^ with 9 w 0.16(4). While 
the total number of vertices increases with time, some vertices 
disappear from the data as time goes on due to permanent or 
temporary shutdown of the corresponding AS. Thus the num- 
ber of vertices No{t) introduced earlier than t — but still 
remaining at time t, decreases with time. Note that the de- 
creasing rate of No{t)is considerably reduced across t « 14 
(see the inset of Fig.|ll). The hub, the vertex with the largest 
degree, remains identical to the initial one throughout the pe- 
riod we studied. 

The average number of degree fc,icw(^) of the vertices newly 
introduced at time t fluctuates in time about a positive mean 
(fcnew)t ~ 1.34 with a standard deviation CTnow ~ 0.05, where 
(• • ■)t denotes the average over a time interval T. This result 



suggests that each newly introduced vertex connects to only 
one or two existing vertices, and internal links between exist- 
ing vertices are created actively as time goes on. So, the In- 
ternet becomes much more interwoven, and L{t) grows faster 
than N{t). 

We consider the dynamics of the degree ki (t) at each ver- 
tex i as a function of time. For convenience, we deal with only 
the vertices existing all the time from t — to t — 26, and 
the set composed of such vertices is denoted by S. Thus the 
degree growth rate of a vertex i, Gi{t) = ln(fci(t)//cj(i — 1)) 
is well defined, because ki{t) ^ for any vertex i Q S for 
all t. The measured value of Gi{t) fluctuates in time about 
a finite value. Let g, = {Gi)t and af = {{d - (G,)t)^)^. 
If the dynamics follows that of the HA model, a histogram 
of gi for many vertices would show the Gaussian distribution 
with mean gcs and variance a'^g/T. We find that {gi} show 
some correlations with the degree of the vertex. In particu- 
lar, the behavior of Gh {t) at the hub is interesting (see the 
inset of Fig.||). It is found that the fluctuation of Gh{t) is 
drastically reduced across t « 15, February, 1999. We ob- 
tain gh ~ 0.037(20) by averaging over the earlier period from 
t = 1 to t = 15, while gt ~ 0.031(6) over the later pe- 
riod from t = 16 to t = 26. Thus, the degree of the hub 
depends on N{t) as kfi{t) ^ N'^{t), where the exponent 77 
is related to gh as r/ — gji/a, exhibiting a crossover behav- 
ior from 77 « 1.3(1) to r/ w 1-0(1) as directly measured (see 
Fig.||). Note that r/ = 0.5 in the BA model. Despite the ap- 
parent correlation of the growth rate with degree, we do not 
attempt further analysis but rather focus on the distribution of 
{gi} below in accordance with the HA idea. Data shown in the 
inset of Fig. 1 and Fig. 2 suggest that the Internet topology has 
become much stabilized around i « 15. Therefore, we will 
use only the data of the later period for further discussions. 

We measured the mean growth rate gi and the correspond- 
ing standard deviation (Ti defined above for each vertex i 
(i e S) by taking average over the period from t — 16 to 
t — 26 (T — 10). The measured values {gi} (i e S) are 
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FIG. 1: Semi-logarithmic plot of the total number of vertices N{t) 
(•) and edges L{t) (o) as a function of time. The dashed and dotted 
lines obtained via the least-square-fit have slopes 0.029 and 0.034, 
respectively. Inset: Semi-logarithmic plot of No{t) versus t. 
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FIG. 2: Plot of the degree of the hub kh{t) versus the number of 
vertices N{t). Inset: Plot of the growth rate Gh{t) of the degree at 
the hub versus t. The dotted lines are the average values for given 
intervals. 
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distributed as shown in the inset of Fig.|[ In this inset, an ab- 
normal peak is located at 5 = 0, which is mostly contributed 
by the vertices whose degree is a few and never changes at all 
during the period we studied. Thus those vertices may be re- 
garded as the ones located at dangling ends in the graph, and 
be ignored for the dynamics of the Internet topology. In the 
inset of Fig.^ we fit the data other than the 5 = peak to the 
Gaussian form with mean g 0.016(2) and standard devia- 
tion (Tg « 0.04. On the other hand, the measured values {cr,} 
are also distributed but with small dispersion around the mean 
a w 0.12(6). In the HA model, g and ag would correspond to 
(/off and CToff / VT, respectively. 

It is most likely that g and a have a distribution among ver- 
tices. To obtain more accurate degree distribution, Eq.(4) has 
to be averaged over those distributions. Not knowing such de- 
tails, however, we try to approximate the growth process by a 
single process whose effective mean growth rate and standard 
deviation are gos and (Joff, respectively. For this purpose, we 
plot in Figj^ the distribution P{ki{t)/ki{to)) in terms of the 
scaled variables x and y defined as 

\n{h{t)/kdto))-gd(t-to) 
X = , (7) 

^2(7d2(i _ Iq) 

and 

y = P{k,{t)/h{to)){k,{t)/h{to))V^Trad^{t ~ to), (8) 

using a semi-logarithmic scale, where gd and ad are fitting 
parameters. We choose to — 0, and the data shown in Fig.^ 
are for times t > 15. It appears that the data for different 
times collapse onto the curve \ny = —x"^ reasonably well for 
small X with our best choice of = 0.016 and (Jd — 0.14 
as shown in Fig.|^. Larger deviations for large x are due to 
t being finite and are caused by the rare statistics of a few 



nodes whose degree increases by an anomalously large factor. 
The values gd and ad are close to g and a, respectively. Also, 
<yd/VT « 0.044 is consistent with ag. We also checked ad by 
measuring the variance of P{ki{t)/ki{tQ)) for each time, and 
plotting them as a function of time. The slope of the asymp- 
totic line in the plot corresponds to cr^. Using this method, we 
also obtain ad ~ 0.14(1), which is in agreement with the one 
obtained through the data-collapse method. Thus, the values 
g^g « 0.016 and acff ~ 0.14 may be regarded as the effective 
values representing the growth process as a single stochastic 
process. Applying those values to the formula Eq.(6), we ob- 
tain the degree exponent 7 w 2.1, which is in agreement with 
the directly measured one 7as ~ 2.2(1) [ [TTI , 

Stochastic model. Using the measured values, a, goff, and 
(Toff, and following the HA idea, we construct a stochastic 
model evolving through the following three rules: (i) Geo- 
metrical growth: At time t, geometrically increased number 
of new vertices, aN{t — 1), are introduced in the system, and 
following the fact (fcnow)t ~ 1.34, each of them connects to 
one or two existing vertices according to the PA rule, (ii) Ac- 
celerated growth: Each existing vertex increases its degree by 
the factor go ~ (?off + '^cs/'^- These internal edges are also 
connected following the PA rule, (iii) Fluctuation and adapta- 
tion: Each vertex disconnects existing edges randomly (resp. 
connects new edges following the PA rule) when the noise 
(Ciit) in Eq.(3)) is chosen to be negative (resp. positive). This 
fluctuation has the variance dg w a'^g. When connecting, 
the PA rule is applied only within the subset of the existing 
vertices consisting of those having more degree than the one 
previously disconnected. This last constraint accounts for the 
adaptation process in our model. Through this adaptation pro- 
cess, the Internet becomes more efficient. 

With this stochastic model, we first measure the degree 
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FIG. 3: Plotof P(ifci(t)/ifci(0)) versus ki{t)/ki{0) for different times 
in terms of y and x defined in the text. The dotted line is our best fit 
of which the peak is located at gd = 0.016 and the standard deviation 
is ad = 0.14. Inset: Plot of the distribution of gi = {Gi)t, Paig), 
versus g = gi for the vertices in the set S. The dashed line is our best 
fit of the central part following the Gaussian distribution of which the 
peak is located at p = 0.016 and the standard deviation is ag — 0.04. 
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FIG. 4: Plot of the conditional probability P{k\k') for the dan- 
gling vertices with k' — 1 (diamonds) and the hub (circles). Data 
points with filled (open) symbols are from the real Internet data 
(the model simulations). The dashed (dotted) line has a slope —1.9 
(—1.1), drawn for a guide to the eye. Inset: Plot of the average de- 
gree of nearest neighbors of a vertex whose degree is k, {k„n) ~ 
y^^i., k'P{k'\k), as a function of k from the model simulation (open 
square) and the real data (filled square). 
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exponent to be 7modei ~ 2.2, close to the empirical result 
7AS ~ 2.2(1). Second, the clustering coefficient is measured 



fund in SNU. 



to be Cr, 



0.15(7), comparable to the empirical value 



Cas ~ 0.25 (see also Ref.|13|). Note that without the adap- 
tation rule, we only get C « 0.01(1). Third, we measure 
the conditional probability, P{k\k'), that the degree of a ver- 
tex is k given that it is connected from a vertex with degree 
k' . For both our model and the real data, it is obtained that 
P{k\k') - fc-i i(i) for small k' and - fc-i-^t^) for large k' 
(see Fig.^. Note that, for linearly growing networks with the 
PA rule, it is known that the probability that vertices of de- 
gree k (ancestor) and k' (descendent) are connected scales as 
~ k~^'^~^^k'~^ iQ, |l6|]. Finally, we measure the average de- 
gree of the nearest neighbors of a vertex whose degree is k, 
(fcnn) = J2k' k'P{k'\k), as a function of k. It exhibits a de- 
caying behavior for large k, in agreement with the observation 
for the real Internet topology, as shown in the inset of Fig.^ 
(see also Ref.|l^. This is in contrast with the fc-independent 
behavior occurring in the BA model. The adaptive feature in 
rule (iii) is crucial to reproduce such detailed agreements be- 
tween our model and the real data: While the degree exponent 
7modoi ~ 2.2 can be obtained without the adaptation rule, 
other results cannot be obtained without it. So adaptation as 
well as fluctuation are essential ingredients to describe the real 
Internet topology correctly. Meanwhile, due to the adaptation 
effect, the network is more centralized to the hub, so that it 
spreads diseases more quickly via the hub [|l7|| and becomes 
more vulnerable to the attacks [ p^ . 

Summary. The Internet topology evolves exponentially in 
the number of vertices and edges as time goes on. The degree 
of each vertex also increases exponentially with time, but its 
growth rate fluctuates strongly from time to time. The effect 
induced by such fluctuations is essential [p^]. This has not 
been incorporated properly so far in most scale-free network 
models. Based on the numerical measurement, we construct a 
stochastic model following the HA idea. In addition, the adap- 
tation process arising in the evolution of edges has been newly 
taken into account in our model, through which we can repro- 
duce the correlations between the degrees of different vertices 
in the real Internet. 
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